Optimizing Irregular Adaptive Applications on Multi-threaded Processors: The Case of Medium-Grain Parallel Delaunay Mesh Generation
نویسنده
چکیده
The Importance of parallel mesh generation and emerging growth of SMT architectures raise an important question of adapting parallel mesh generation software to the SMT architecture. In this work we focus on Parallel Constrained Delaunay Mesh Generation. We explore medium grain parallelism at the sub-domain level. This parallel approach targets commercially available SMT processors. Our goal is to improve the performance of the existing, MPI-based, parallel mesh generation software (PCDT) by exploiting multi-threading inside a single SMT chip. This report presents a parallel mesh generation software based on medium grain parallelism, which we developed on top of the existing PCDT program. By extending PCDT instead of creating completely new software, we reduced the development complexity. We achieved 100% code reuse. Experimental evaluation shows that using different contexts of an SMT processor can improve the performance of a parallel mesh generation software. However, the medium grain approach suffers from significant synchronization overhead caused by different threads working on the same sub-domain. There are 2 general contributions of this work. First, we extended the coarse-grain parallel mesh generation software (PCDT), combining the coarse-grain approach with a medium-grain approach. The second contribution is that we significantly improved the performance of the PCDT software using the optimizations developed for the MPCDT code and the SMT architecture. These changes made PCDT faster than Triangle, the best publicly available 2D Delaunay mesh generation software, when executed on a single physical SMT processor.
منابع مشابه
A multigrain Delaunay mesh generation method for multicore SMT-based architectures
Given the proliferation of layered, multicoreand SMT-based architectures, it is imperative to deploy and evaluate important, multi-level, scientific computing codes, such as meshing algorithms, on these systems. We focus on Parallel Constrained Delaunay Mesh (PCDM) generation. We exploit coarse-grain parallelism at the subdomain level, medium-grain at the cavity level and fine-grain at the elem...
متن کامل2D Parallel Constrained Delaunay Mesh Generation: A Multigrain Approach on Deep Multiprocessors
Parallel Constrained Delaunay Mesh (PCDM) is a 2D adaptive and irregular meshing algorithm. In PCDM one can explore concurrency using three different levels of granularity: (i) coarse-grain at the sub-mesh level, (ii) medium-grain at the cavity level and (iii) fine-grain at the element level. The mediumand fine-grain approaches can be used to improve the single-processor performance of coarse-g...
متن کاملTowards Exascale Parallel Delaunay Mesh Generation
Mesh generation is a critical component for many (bio-)engineering applications. However, parallel mesh generation codes, which are essential for these applications to take the fullest advantage of the high-end computing platforms, belong to the broader class of adaptive and irregular problems, and are among the most complex, challenging, and labor intensive to develop and maintain. As a result...
متن کاملParallel Two-Dimensional Unstructured Anisotropic Delaunay Mesh Generation for Aerospace Applications
PARALLEL TWO-DIMENSIONAL UNSTRUCTURED ANISOTROPIC DELAUNAY MESH GENERATION FOR AEROSPACE APPLICATIONS Juliette Kelly Pardue Old Dominion University, 2015 Director: Dr. Andrey Chernikov A bottom-up approach to parallel anisotropic mesh generation is presented by building a mesh generator from the principles of point-insertion, triangulation, and Delaunay refinement. Applications focusing on high...
متن کاملHigh-Level Parallel Programming of an Adaptive Mesh Application Using the Illinois Concert System
We have used the Illinois Concert C++ system (which supports dynamic, object-based parallelism) to parallelize a exible adaptive mesh reenement code for the Cosmology NSF Grand Challenge. Out goal is to enable programmers of large-scale numerical applications to build complex applications with irregular structure using a high-level interface. The key elements are an aggressive optimizing compil...
متن کامل